Picture for Alan Akbik

Alan Akbik

Self-Aware Knowledge Probing: Evaluating Language Models' Relational Knowledge through Confidence Calibration

Add code
Jan 26, 2026
Viaarxiv icon

Beyond Marginal Distributions: A Framework to Evaluate the Representativeness of Demographic-Aligned LLMs

Add code
Jan 22, 2026
Viaarxiv icon

What Matters When Building Universal Multilingual Named Entity Recognition Models?

Add code
Jan 09, 2026
Viaarxiv icon

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

Add code
Dec 15, 2025
Figure 1 for FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition
Figure 2 for FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition
Figure 3 for FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition
Figure 4 for FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition
Viaarxiv icon

Pre-Training Curriculum for Multi-Token Prediction in Language Models

Add code
May 28, 2025
Viaarxiv icon

Evaluating Design Decisions for Dual Encoder-based Entity Disambiguation

Add code
May 16, 2025
Viaarxiv icon

Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models

Add code
Apr 19, 2025
Figure 1 for Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models
Figure 2 for Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models
Figure 3 for Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models
Figure 4 for Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models
Viaarxiv icon

MastermindEval: A Simple But Scalable Reasoning Benchmark

Add code
Mar 11, 2025
Figure 1 for MastermindEval: A Simple But Scalable Reasoning Benchmark
Figure 2 for MastermindEval: A Simple But Scalable Reasoning Benchmark
Figure 3 for MastermindEval: A Simple But Scalable Reasoning Benchmark
Figure 4 for MastermindEval: A Simple But Scalable Reasoning Benchmark
Viaarxiv icon

BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models

Add code
Dec 20, 2024
Figure 1 for BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Figure 2 for BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Figure 3 for BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Figure 4 for BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Viaarxiv icon

Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data

Add code
Dec 13, 2024
Figure 1 for Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Figure 2 for Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Figure 3 for Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Figure 4 for Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Viaarxiv icon